Extraction and Analysis of Document Examiner Features from Vector Skeletons of Grapheme 'th'

نویسندگان

  • Vladimir Pervouchine
  • Graham Leedham
چکیده

This paper presents a study of 25 structural features extracted from samples of grapheme ‘th’ that correspond to features commonly used by forensic document examiners. Most of the features are extracted using vector skeletons produced by a specially developed skeletonisation algorithm. The methods of feature extraction are presented along with the results. Analysis of the usefulness of the features was conducted and three categories of features were identified: indispensable, partially relevant and irrelevant for determining the authorship of genuine unconstrained handwriting. The division was performed based on searching the optimal feature sets using the wrapper method. A constructive neural network was used as a classifier and a genetic algorithm was used to search for optimal feature sets. It is shown that structural micro features similar to those used in forensic document analysis do possess discriminative power. The results are also compared to those obtained in our preceding study, and it is shown that use of the vector skeletonisation allows both extraction of more structural features and improvement the feature extraction accuracy from 87% to 94%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Extraction of Suitable Features for Breast Cancer Detection Using Dynamic Analysis of Thermographic Images

Introduction: Thermography is a non-invasive imaging technique that can be used to diagnose breast cancer. In this study, a method was presented for the extraction of suitable features in dynamic thermographic images of breast. The extracted features can help classify thermographic images as cancerous or healthy. Method: In this descriptive-analytical study, the images were taken from the IC/UF...

متن کامل

Extraction of Suitable Features for Breast Cancer Detection Using Dynamic Analysis of Thermographic Images

Introduction: Thermography is a non-invasive imaging technique that can be used to diagnose breast cancer. In this study, a method was presented for the extraction of suitable features in dynamic thermographic images of breast. The extracted features can help classify thermographic images as cancerous or healthy. Method: In this descriptive-analytical study, the images were taken from the IC/UF...

متن کامل

Heart Rate Variability Classification using Support Vector Machine and Genetic Algorithm

Background: Electrocardiogram (ECG) is defined as an electrical signal, which represents cardiac activity. Heart rate variability (HRV) as the variation of interval between two consecutive heartbeats represents the balance between the sympathetic and parasympathetic branches of the autonomic nervous system.Objective: In this study, we aimed to evaluate the efficiency of discrete wavelet transfo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006